Who, What, When, Where, Why? Comparing Multiple Approaches to the Cross-Lingual 5W Task
نویسندگان
چکیده
Cross-lingual tasks are especially difficult due to the compounding effect of errors in language processing and errors in machine translation (MT). In this paper, we present an error analysis of a new cross-lingual task: the 5W task, a sentence-level understanding task which seeks to return the English 5W's (Who, What, When, Where and Why) corresponding to a Chinese sentence. We analyze systems that we developed, identifying specific problems in language processing and MT that cause errors. The best cross-lingual 5W system was still 19% worse than the best monolingual 5W system, which shows that MT significantly degrades sentence-level understanding. Neither source-language nor targetlanguage analysis was able to circumvent problems in MT, although each approach had advantages relative to the other. A detailed error analysis across multiple systems suggests directions for future research on the problem.
منابع مشابه
Comparing Multiple Approaches to the Cross-Lingual 5W Task
Cross-lingual tasks are especially difficult due to the compounding effect of errors in language processing and errors in machine translation (MT). In this paper, we present an error analysis of a new cross-lingual task: the 5W task, a sentence-level understanding task which seeks to return the English 5W's (Who, What, When, Where and Why) corresponding to a Chinese sentence. We analyze systems...
متن کاملThe 5W Structure for Sentiment Summarization-Visualization-Tracking
In this paper we address the Sentiment Analysis problem from the end user’s perspective. An end user might desire an automated at-a-glance presentation of the main points made in a single review or how opinion changes time to time over multiple documents. To meet the requirement we propose a relatively generic opinion 5Ws structurization, further used for textual and visual summary and tracking...
متن کاملThe Why, Who, What, How, and When of Patient Engagement in Healthcare Organizations: A Response to Recent Commentaries
متن کامل
Wh-Questions\' Expression in Persian-speaking Children: A Comparison Between Spontaneous and Elicited Probes
Objectives: Studies have shown that most children before the age of 5 are capable to comprehend and express wh-questions in daily conversations. This study aimed at comparing the ability of wh-questions’ production in 4- to 6-year-old children in spontaneous and elicited conditions. Methods: In this descriptive-analytic study, 4- to 6-year-old Persian-speaking children (N = 72) were selected r...
متن کاملGiveme5W: Main Event Retrieval from News Articles by Extraction of the Five Journalistic W Questions
Extraction of event descriptors from news articles is a commonly required task for various tasks, such as clustering related articles, summarization, and news aggregation. Due to the lack of generally usable and publicly available methods optimized for news, many researchers must redundantly implement such methods for their project. Answers to the five journalistic W questions (5Ws) describe th...
متن کامل